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Abstract 



We study three variants of multi-prover quantum Merlin-Arthur proof systems. We first 
show that the class of problems that can be efficiently verified using polynomially many quan- 
tum proofs, each of logarithmic-size (denoted QMAi og (poly)), is exactly MQA, the class of 
problems which can be efficiently verified via a classical proof and a quantum verifier. Assum- 
ing BQP 7^ MQA, this achieves the separation QMA log (1 ) / QMA log (poly) . We then study the 
class BellQMA(poly), characterized by a verifier who first applies unentangled, nonadaptive 
' measurements to each of the polynomially many proofs, followed by an arbitrary but efficient 

quantum verification circuit on the resulting measurement outcomes. We show that if the num- 
ber of outcomes per nonadaptive measurement is a polynomially-bounded function, then the 
expressive power of the proof system is exactly QMA. Finally, we study a class equivalent to 
QMA(m), denoted SepQMA(m), where the verifier's measurement operator corresponding to 
O" 1 outcome accept is a fully separable operator across the m quantum proofs. Using cone pro- 

gramming duality, we give an alternate proof of a result of Harrow and Montanaro [FOCS, p. 
633-642 (2010)] that shows a perfect parallel repetition theorem for SepQMA(m) for any m. 



1 Introduction and summary of results 



> 

o 

The study of classical proof systems has yielded some of the greatest achievements in theoretical 
computer science, from the Cook-Levin theorem llCoo71t ILev73L which formally ushered in the 
age of NP verification systems and the now ubiquitous notion of NP-hardness, to the more modern 
PCP theorem llAS98llALM + 98ll , which has led to significant advancements in our understanding 
of hardness of approximation. A natural generalization of the class NP, or more accurately its 
probabilistic cousin Merlin-Arthur (MA), to the quantum setting is the class quantum Merlin- 
Arthur (QMA) IIKSV02L where a computationally powerful but untrustworthy prover, Merlin, 
sends a quantum proof to convince an efficient quantum verifier, Arthur, that a given input string 
x £ {0, 1}" is a YES-instance for a specified promise problem. 

More specifically, a QMA proof system for a given promise problem A is characterized by the 
following properties (see Section I2TT1 for formal definitions): 

• For every YES-instance x of A, there exists a polynomial-size quantum proof which can 
convince Arthur of this fact with high probability, with the smallest such success probability 
over all YES-instances called the completeness of the protocol. 
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• For every NO-instance x of A and for any purported quantum proof, Arthur rejects with 
high probability, with the maximum success probability over all NO-instances called the 
soundness of the protocol. 

It is easy to see that QMA proof systems are at least as powerful as NP or MA, since the abil- 
ity to process and exchange quantum information does not prevent Arthur from choosing to act 
classically. 

Much attention has been devoted to QMA over recent years. We now have a number of prob- 
lems which are complete for QMA (see e.g. IBra06l ILiu06l IBS071 ILCV071 ISV091 |JGL10t IWMNlOl 
IRoslllO , with the quantum analogue of classical constraint satisfaction, the physically motivated 
fc-local Hamiltonian problem 1IKSV021 IKR031 IKKR061 IOT081 1 AGIK09II . being the canonical QMA- 
complete problem. In analogy with NP-complete problems, it is tempting to think of QMA- 
complete problems as hard even for a quantum computer to solve, though this is somewhat of 
a misnomer as even NP-complete problems are generally believed to be intractable for quantum 
computers. QMA is an extremely robust complexity class that satisfies strong error-reduction 
properties, and using these properties one can, e.g., give a very elegant and simple proof that 
MA C QMA C PP (the first containment follows trivially from the definition) IMW05I . However, 
there still remain important open questions — for example, despite the fact that MA is contained 
in the polynomial hierarchy (PH) [AB09J, we do not know whether QMA C PH. 

An approach for understanding a complexity class is to consider how introducing variations 
to its definition changes its properties. In this paper, we thus ask: How does allowing multiple unen- 
tangled provers affect the expressive power of QMA? In particular, we are interested in variants of the 
class QMA(poly), a.k.a. quantum Merlin-Arthur proof systems with polynomially many Merlins, 
where the verifier receives a polynomial number of quantum proofs, which are promised to be 
unentangled with each other. Note that the classical version of this class collapses trivially to MA, 
as the set of potential strategies of a single Merlin and the set of potential strategies of multiple 
Merlins coincide. This logic fails, however, in the quantum case, as a single Merlin simulating the 
action of multiple Merlins can try to cheat by entangling the multiple proofs. Despite much effort, 
very little is known (more details under Previous Work below) about the structural properties of 
QMA(poly), except for the obvious containments QMA C QMA(poly) C NEXP. 

Our results: We show the following three results regarding variants of QMA (poly) . 

1. Relationship to MQA. We first show that restricting the definition of QMA (poly) so that each 
prover's proof is at most a logarithmic number of qubits (a class which we henceforth denote as 
QMA log (poly) ) collapses it to the class MQA, where MQA is defined^ as QMA except that Merlin's 
proof is a polynomial-size classical string. In other words, if each prover is restricted to sending 
short quantum proofs, then one can not only do away with multiple provers, but also of the need 
for quantum proofs altogether. Specifically, we show: 

Theorem 1.1. QMA log (poly) = MQA. 

The significance of this result is as follows: Understanding the expressive power of QMA(poly), 
or its relationship with QMA, is currently one of the biggest challenges in quantum complexity 
theory. (It was recently shown that, in fact, QMA(poly) = QMA(2) IHMlOj , where QMA(2) is 
QMA with two unentangled provers.) Theorem 11.11 settles this question in the logarithmic-size 

x Note: MQA has been studied 1 AN02I |jW06l IAar061 IAK071 IBei08l IABOBS08I IWY081 under the name QCMA in the 
literature — the notation MQA was suggested by Watrous |W at09| . 
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message setting. Moreover, it immediately implies the following separation, where QMA log is 
QMA restricted to a logarithmic-size proof: 

Corollary 1.2. 7/ BQP ^ MQA, then QMA log ^ QMA log (poly) . 

This follows simply because QMA log = BQP IIMW05L Note that the assumption BQP ^ MQA is 
reasonable, as otherwise quantum computers could efficiently solve NP-complete problems. 

2. Towards a non-trivial upper bound on BellQMA(poly). Another approach to studying the 
question of whether QMA = QMA(poly) is to understand the properties of restricted versions of 
QMA(poly), and this is precisely where the class BellQMA(poly) comes into play. BellQMA(poly) 
is defined RBra08l lABD+091 ICD10H analogously to QMA (poly), except that before applying his 
quantum verification circuit to the polynomially many unentangled quantum proofs, Arthur must 
measure each proof using a nonadaptive and unentangled (across all proofs) measurement (we 
call this Stage 1 of the verification). He then feeds the resulting classical outcomes induced by these 
measurements into his arbitrary efficient quantum circuit (we call this Stage 2). This quantum 
circuit implements a two-outcome measurement operation corresponding to outcomes accept and 
reject. 

The significance of BellQMA(poly) in our setting is that if one could show that QMA ^ 
BellQMA(poly), then it would follow that QMA ^ QMA(poly), since QMA C BellQMA(poly) C 
QMA (poly). To this end, Brandao has shown the negative result that QMA = BellQMA (m) for 
constant m l|Bra08L Where BellQMA(poly) lies, however, remains open. For example, although 
we know QMA(2) = QMA(poly) IIHM10L the same techniques do not apply in any obvious 
way to show an analogous result BellQMA(2) = BellQMA(poly) as they require entangled mea- 
surements (i.e. SWAP test measurements) across multiple proofs, which violate the definition of 
BellQMA. 

To make progress on BellQMA(poly), we introduce the class BellQMA[r, m], which is defined 
as BellQMA (m) with m provers and the additional restriction that in Stage 1 above, the num- 
ber of outcomes per proof in Arthur's nonadaptive measurements is upper bounded by r. Our 
contribution is the following: 

Theorem 1.3. For any polynomially bounded functions r, m : N — > N, BellQMA [r, m] = QMA. 

In other words, BellQMA (poly) cannot be used to show QMA ^ QMA (poly) if the verifier in the 
BellQMA (poly) protocol is restricted to have a polynomially bounded number of measurement 
outcomes per proof in Stage 1. We remark that, in general, the number of such measurement out- 
comes can be exponential in the input length — the restriction that r be a polynomially bounded 
function is crucial for the proof of Theorem 11.31 For this reason, our result complements, rather 
than subsumes Brandao's result BBra08L In other words, in our notation, Brandao has shown that 
BellQMA [exp, const] = QMA, and we show BellQMA [poly, poly] = QMA. 

Readers should note that we allow the second stage of the verification procedure to be quantum, 
as per the definition suggested by Chen and Drucker IICD10L as opposed to classical, as studied by 
Brandao [Bra08J. We remark that the conclusion of Theorem 1 1 .31 holds even if the second stage of 
verification is completely classical. 

Finally, it is worth noting that by combining Theorems 11.11 and 11.31 we conclude that in the 
setting of BellQMA (poly), if MQA ^ QMA, then having the Merlins send logarithmic-size proofs 
without any restriction on the number of local measurement outcomes of Arthur in Stage 1 has less 
expressive power than sending polynomial-size proofs but restricting the number of outcomes, 
even though the number of measurement outcomes in Stage 1 per Merlin in both cases is the 
same, i.e. polynomial in the input length. 
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3. Perfect parallel repetition for SepQMA(m). A key question in designing proof systems is 
how to improve the completeness and soundness parameters of a verification protocol without 
increasing the required number of rounds of communication. A natural approach for doing so is 
to repeat the protocol multiple times in parallel. With QMA, however, this raises the concern that 
Merlin might try to cheat by entangling his proofs across these parallel runs. If, though, perfect 
parallel repetition holds, it means that for any input string x, if the verification procedure V accepts 
with probability p(|x|), then if we run V k times in parallel, the probability of accepting in all 
k runs of V is precisely . Note that we do not put any restriction on the quantum proof, 

which can be entangled across the k executions of the protocol. In other words, if perfect parallel 
repetition holds, there is no incentive for Merlin to cheat — an honest proof which is a product 
state across all k runs achieves the maximum success probability. 

Our final contribution is an alternate proof of a perfect parallel repetition theorem for a class 
which is equivalent DHM10II to QMA(m), namely SepQMA(m). The theorem was first proved in 
Harrow and Montanaro [HMTO] in connection with an error reduction technique for QMA (poly). 
However, our proof is significantly different from theirs and uses the cone programming char- 
acterization of QMA(poly). Here SepQMA(m) is defined as QMA(m) with the restriction that 
Arthur's measurement operator corresponding to acceptance is a separable operator across the m 
unentangled proofs. (Note that this does not imply that Arthur's measurement operator corre- 
sponding to rejection is also separable.) We show: 

Theorem 1.4 (see IIHM10II for alternate proof). The class SepQMA(m) admits perfect parallel repeti- 
tion. 

Our alternate proof of Theorem 11.41 is significant in that, to the best of our knowledge, it is the 
first use of duality theory for a cone program other than a semidefinite program to establish a par- 
allel repetition result (note that cone programming generalizes semidefinite programming). We 
remark that semidefinite programs have been previously used to show perfect or strong parallel 
repetition theorems for various other models of (single or two-prover) quantum interactive proof 
systems BCSUU081 IKRTlOi IGut09H . and that the alternate proof of Theorem O of Harrow and 
Montanaro is not based on semidefinite programming. Perfect parallel repetition for SepQMA(m) 
in itself is interesting, as it has been used to show that error reduction is possible for QMA(ra) 
proof systems BHM10L 

Proof ideas and tools: The proof of our first result, Theorem 11.11 is a simple application of the 
fact that quantum states of logarithmic-size can be described to within inverse exponential pre- 
cision using a polynomial number of bits and efficiently prepared by a quantum circuit. Hence, 
roughly speaking, one can replace a polynomial number of logarithmic-size quantum proofs with 
a single polynomial size classical proof. Each quantum proof can then be efficiently prepared 
from this classical information by the verifier to within inverse exponential precision since the 
original quantum proofs were of logarithmic-size. Although the proof is simple, one cannot 
hope for a better characterization using other techniques because the reverse containment, i.e. 
MQA C QMAi og (poly), also holds using similar ideas. 

More technically challenging is our second result, Theorem 11.31 To show the containment 
BellQMA [poly, poly] C QMA (note that the reverse containment QMA C BellQMA [poly, poly] 
is trivial since QMA C BellQMA [2, 1]), we demonstrate a QMA protocol which simulates an ar- 
bitrary BellQMA [poly, poly] protocol using the following observation: although consolidating m 
quantum proofs into a single quantum proof raises the possibility of cheating using entanglement, 
if Arthur is also sent an appropriate classical "consistency-check" string, then a dishonest Merlin 
can be caught with non-negligible probability 
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Specifically, in our QMA protocol, we ask a single Merlin to send the m quantum proofs of the 
original BellQMA protocol (denoted by a single state \ip)), accompanied by a "consistency-check" 
string p which is a classical description of the probability distributions obtained as the output 
of Stage 1. One can think of this as having the QMA verifier delegate Stage 1 of the BellQMA 
verification to Merlin. Arthur then performs a consistency check between \tp) and p based on 
the premise that if Merlin is honest, then p should arise from running Stage 1 of the original 
verification on \tp). If this check passes, then Arthur runs Stage 2 of the BellQMA verification 
on p. If Merlin tries to cheat, however, we show that the check detects this with non-negligible 
probability. Note that the accuracy of the consistency check crucially uses the fact that there are at 
most polynomially many outcomes to check for each local measurement of Stage 1. 

Finally, our last result, Theorem 11.41 is shown using duality theory for a class of cone pro- 
grams that captures the success probability of a QMA(poly) protocol. In particular, we phrase the 
maximum acceptance probability of a (possibly cheating) prover for the two-fold repetition of a 
SepQMA(m) verification protocol as a cone program. We then demonstrate a feasible solution for 
its dual yielding an upper bound on the maximum acceptance probability. The objective value 
of this dual solution is precisely the product of the optimum values of the two instances of the 
SepQMA(m) verification protocols. We conclude that one of the optimal strategies of the provers 
is to be faithful in the following sense: Each prover elects not to entangle his/her two quantum 
proofs for the two instances of the SepQMA(m) protocol and instead sends a tensor product of 
optimal proofs for both the instances. 

Previous work. The expressive power of multiple Merlins was first studied by Kobayashi, Mat- 
sumoto and Yamakami IKMY03L who showed that QMA(2) = QMA(poly) if and only if the 
class of QMA(2) protocols with completeness c and soundness s (with at least inverse polynomial 
gap) is exactly equal to QMA (2) protocols with completeness 2/3 and soundness 1/3. A substan- 
tial amount of research has since been devoted to understanding the properties of multi-prover 
quantum Merlin- Arthur proof systems. Recently, Harrow and Montanaro IIHM10II demonstrated 
a product state test, wherein given two copies of a pure quantum state on multiple systems, the 
test distinguishes between the cases when the quantum state is a fully product state across all 
the systems or far from any such state. Using this test, they answered a few important questions 
regarding QMA(poly). In particular, they showed that 

QMA(2) = QMA(poly) 

and that error reduction is possible for such proof systems. Prior to their result, the answers to 
both the questions were known to be affirmative assuming a weak version of the Additivity Con- 
jecture |ABD + Q9l . One of the crucial properties of the product state test is that it can be converted 
into a QMA (2) protocol, where Arthur's measurement operator corresponding to outcome accept 
is a separable operator across the two proofs. Harrow and Montanaro established a perfect paral- 
lel repetition theorem for such proof systems, a crucial step in obtaining exponentially small error 
probabilities. 

Blier and Tapp initiated the study of logarithmic-size unentangled quantum proofs [BT09J. They 
showed that two unentangled quantum proofs suffice to show that a 3-coloring of an input graph 
exists, implying that NP has succinct unentangled quantum proofs. A drawback of their protocol 
is that although it has perfect completeness, its soundness is only inverse polynomially bounded 
away from 1. Shortly after, Aaronson, Beigi, Drucker, Fefferman and Shor ||ABD + Q9l showed 
that satisfiability of any 3-SAT formula of size n can be proven by 0(\/~n) unentangled quantum 
proofs of O(logn) qubits with perfect completeness and constant soundness (see also IICD10II ). In 
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a subsequent paper MBei08L Beigi improved directly on Blier and Tapp's result BBT091I by showing 
that by sacrificing perfect completeness, one can show that NP has two logarithmic-size quantum 
proofs with a better gap between completeness and soundness probabilities than in IIBT09L 

Finally one of the open questions raised in Ref. ||ABD + 09| concerns the power of Arthur's 



verification procedure. In particular, the paper introduces two different classes of verification 
procedures, BellQMA and LOCCQMA verification. Roughly speaking, LOCCQMA verification 
corresponds to Arthur applying a measurement operation that can be implemented by Local 
Operations and Classical Communication (LOCC) (with respect to the partition induced by the 
multiple proofs). The authors raised the question of whether BellQMA(poly) = QMA or not. 
Brandao IBra08ll showed that BellQMA(m) is equal to QMA for constant m. In a recent develop- 
ment, Brandao, Christandl and Yard RBCYIU showed that LOCCQMA (m) is equal to QMA for 
constant m. 

Organization of this paper. We begin in Section [2] with background and notation, defining rel- 
evant complexity classes in Section [2TTI and reviewing cone programming in Section |Z21 The- 
orems 11.11 11.31 and 11.41 are proved in Sections El HI and |5[ respectively. We conclude with open 
problems in Section [U 

2 Preliminaries and Notation 

We begin by setting our notation, and subsequently review the background material required 
for this paper. First, the notation [m] indicates the set {1, . . . ,m}, and \x\ the length of a string 
x G {0, 1}* . We let uppercase script letters X , y, Z denote complex Euclidean spaces. We denote 
the sets of linear, Hermitian, positive semidefinite, and density operators acting on vector space X 
by L (X), Herm (X), Pos (X), and D (X), respectively. We denote the standard Hilbert-Schmidt 
inner product of operators A and B as (A, B) := Tr(A*B), where A* denotes the adjoint of A. The 
spectral and trace norms of an operator A are given by || A || ^ := max{|| Au\\ : \\u\\ = 1} and 
ll^lltr := T r (V A* A), respectively, where ||w|| denotes the Euclidean norm of a vector u. These 
can be thought of as the largest singular value and the sum of singular values of A, respectively. 
A useful lemma in this paper regarding the trace norm is the following: 

Lemma 2.1 ([WatQ2|). Let {p 1 ...,p k } C D (X) and {a lf ...,cr k } C D (X). Then 

k k k 



i=l i=i 



t II tr ' 

tr (=1 



Next, we say a (possibly unnormalized) operator A E Pos (X\ ® • • ■ ® X m ) is fully separable if 
it can be written as 

k 

A = Y^P!{i)®---^P m (i) r 

i=l 

where Py(z') G Pos (<-f/), for every ; G [m] and i G [k]. The set of fully separable operators is de- 
noted Sep (Xi, X2, . . . , X m ). This notation is helpful in the context of cone programming. In the 
setting of quantum information, one typically also has Tr(A) = 1. The set of fully separable den- 
sity operators is convex, compact, and has non-empty interior since it contains a ball around the 
normalized identity operator IIGB021 IGB031 IGB051I . Using these facts, one can formally prove that 
the problem of determining whether a given density operator is close to the set of separable states 
is NP-hard for the bipartite case (i.e., m = 2) MGur03UIoa07UGhalOUBei08|l . For the multipartite case 
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(i.e., m > 2), the classification of entanglement becomes much more intricate due to, e.g., notions 
of partial separability (hence our use of the term fully separable) — see the reference DHHHH09I for 
a comprehensive survey. 

We use the fact that any pure quantum state \xp) £ C N can be described approximately classi- 
cally using N ■ f(N) bits, for some function / : N — > IN. The resulting approximate description 
\ip') satisfies || \ip) — \xp') \\ < N2~(f( N ^ +1 \ We also speak in terms of quantum registers rather than 
quantum states in the next two sections. To make the association precise, an n-qubit quantum 
register X is associated with a vector space X = C 2 and contains any element of D (X). 

Finally, moving to quantum operations, the notion of measurement used in this paper is that 
of a Positive Operator Valued Measure (POVM), given by a finite set of positive semidefinite op- 
erators {TIi, . . . , Yl r } C Pos (X) obeying 
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Regarding unitary operators, we use the fact that any unitary operator acting on k qubits can 
be approximated within high precision by a finite set of one-qubit, two-qubit, and/ or three-qubit 
unitary operators. Such a finite set is often referred to as an approximately universal set of quantum 
gates, and one such set is comprised of the Toffoli, Hadamard, and phase-shift gates. The Solovay- 
Kitaev theorem implies that the action of an arbitrary unitary operator U on k qubits can be simu- 



lated by a composition U of 0(4* log(l/e)) universal gates, such that 



2.1 Relevant quantum complexity classes 



U-U 



< e llNCOul 



A promise problem A = (A yes/ A no ) is a partition of the set {0,1}* into three disjoint subsets: 
the set Ay es denotes the set of YES-instances of the problem, the set A no denotes the set of NO- 
instances of the problem, and the set {0, l}*\(A yes U A n0 ) is the set of disallowed strings (we are 
promised the input does not fall into this last set). 

Definition 2.2 (QMA(m)). Let p : N — > N be a polynomially bounded function, and m : N — > N a 
function. A promise problem A = (A yes ,A no ) is in the class QMA(m) if there exists a polynomial-time 
generated family of verification circuits Q = {Q n | n GN} with the following properties: 

1. Each Q n acts onn + p(n) input qubits, and outputs one qubit. 

2. (Completeness) For every x S A yes , there exist m(\x\) quantum proofs , \tp2) \ip m (\x\)) e 
£ 2P{M) suc h that 

Pt[Q\ x \ accepts (x, |t/>i) ® . . . ® \xp m{ \ x[] ))} > 2/3. 

3. (Soundness) For any x G A no and any m(\x\) quantum proofs \ipi) , \ip2) , . . ., \ip m (\x\)) e C 2P<W) 

Pr[Q| x | accepts (x, \ipi) <g> . . . ® |^ m( | x |)))] < 1/3. 

Furthermore, the class QMA(poly) is defined as QMA(poly) = Umepoly QMA(m). 

We remark that the constants 2/3 and 1/3 can be replaced by any a, b > 0, respectively, such 
that a — b > l/poly(n). This does not change the expressive power of the proof system. All 
complexity classes considered in this paper are variants of QMA(m) and satisfy the properties 
mentioned above in Definition 12.21 We define the following classes, which are relevant to this 
paper. 
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1. [QMA and MQA] The class QMA is simply QMA(l). If we replace the quantum proofs in 
the definition of QMA with a polynomial-size classical proof string, the corresponding class is 
denoted MQA. 

2. [SepQMA(poly)] The class SepQMA(poly) is a subclass of QMA(poly), wherein Arthur's 
measurement operator corresponding to outcome accept is a fully separable operator across 
the proofs. 

3. [QMAi og (poly)] The class QMA log (poly) is a subclass of QMA(poly), wherein each Merlin's 
message to Arthur is 0(log(|xQ) qubits in length. 

Finally for clarity, we opt to give a more formal definition of BellQMA [r, m] . 

Definition 2.3 (BellQMA [r, m]). Let r, m : N — > N be two functions. A promise problem A = 
{Ay es , A no ) is in class BellQMA[r, m] if there exists a QMA(m) verification protocol in which Arthur 
is restricted to act as follows. 

1. Arthur performs a polynomial-time quantum computation on the input x and generates a description 
of quantum circuits V\(x), . . . , V m (x), one for each of the m provers. 

2. (Stage 1) Arthur simultaneously measures all m quantum proofs by applying V{{x) to the i-th quantum 
proof, where the action ofV{(x) can be described by a unitary operator followed by measurement in the 
standard basis. The label of the i-th measurement outcome is stored as a classical string j/j also identified 
as an element of\r(\x\)\. 

3. (Stage 2) Arthur runs an efficient quantum verification circuit on input x and measurement outcomes 
(yi, . . . ,y m ) to decide whether to accept or reject. 

Note that the key distinction between BellQMA [r,m] and BellQMA (poly) is that the former has 
the number of measurement outcomes in Stage 1 of the protocol bounded by whereas 
the latter may allow exponentially many possible outcomes. Throughout this paper, we use the 
notation BellQMA [poly, poly] to denote 

BellQMA [poly, poly] := |J (J BellQMA [r,m]. 

rGpoly mepoly 

We remark that, as in IICD10L our BellQMA protocols are allowed to use a quantum verification 
circuit in Stage 2, whereas originally in references BBra08t lABD + 09l only classical processing of 



measurement outcomes { y ; - } was allowed in order to emulate the notion of a Bell experiment per- 
formed by Arthur. We again remark that Theorem 11.31 holds even if Arthur is restricted to do 
classical processing on the measurement outcomes. 



2.2 Cone programming 

We now briefly review basic notions in conic optimization (or cone programming), which is a 
generalization of semidefinite optimization. We say that a set K in an underlying Euclidean space 
is a cone if x G K implies that Ax G K for all A > 0. A cone K is convex if x, y G K implies that 
x + y G K. Cone programs are concerned with optimizing a linear function over the intersection 
of a convex cone and an affine space. It generalizes several well-studied models of optimization 
including semidefinite programming (K = Fos(X)) and linear programming (K = R" ). In this 
paper, we are primarily concerned with the cone of fully separable operators Sep {X\, Xi, . . . , X m ) 
which recall is a closed, convex cone with non-empty interior. 
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Associated with a cone K is its dual cone K* defined as 

K* = {S : (X, S) > for all X G K} . 

A cone program associates the following 4-tuple (C, b, A, K) to an optimization problem described 
as: 

supremum: (X, C) 
subject to: A(X) = b, 
XGK, 

where A : Span(K) — > R m is a linear transformation. Note that the inner product is defined as in 
the Euclidean space. For instance, if the cone under consideration is the set of positive semidefinite 
or separable operators, then the inner product is the standard Hilbert-Schmidt inner product over 
the space of Hermitian operators. We say that the cone program is feasible if {X : A(X) = b} n K 
is non-empty and strictly feasible if {X : A(X) = b} fl int(X) is non-empty where int(-) denotes 
the interior of a set. 

Cone programs come in primal-dual pairs: 

Primal problem (P) Dual problem (D) 

supremum: (X, C) infimum: (b,y) 

subject to: A(X) = b, subject to: A*{y) = C + S, 

X G K, S ef, 

where A* is the adjoint of A. A convex cone K is closed if and only if K = K**. In other words, 
the dual of the cone K* is the original cone K. Thus, if K is not closed we need to "order" the 
primal-dual pairs since K ^ K** implying the dual of the dual problem is not equal to the primal 
problem. Since the convex cone of fully separable operators is closed, ordering the primal-dual 
pairs is not an issue in our case. 

Similar to linear programming and semidefinite programming, cone programming has a rich 
duality theory. 

Lemma 2.4 (Weak Duality). IfX is primal feasible and (y, S) is dual feasible then 

(b,y)-(X,C) = (X,S) >0. 

This result can be used to show upper bounds on the value of the primal problem or lower 
bounds on the value of the dual problem. There is also a notion of strong duality. We say that 
strong duality holds for a problem (P) if the optimal value of (P) equals the optimal value of (D) and 
(D) attains an optimal solution. Below we give a condition that guarantees strong duality for (P). 

Theorem 2.5 (Strong Duality, Version 1). If (P) is strictly feasible and the optimal value is bounded from 
above, then strong duality holds for (P), i.e., (D) attains an optimal solution and the optimal values for (P) 
and (D) coincide. 

In this paper, we are concerned with closed, convex cones with non-empty interior. Since 
the dual of the dual problem is the primal problem when K is closed, we can use the following 
stronger version of strong duality. 

Theorem 2.6 (Strong Duality, Version 2). Suppose K is a closed, convex cone. If (P) and (D) are both 
strictly feasible then strong duality holds for both problems, i.e., both problems attain an optimal solution 
and the optimal values coincide. 

We refer the reader to the work of Tunnel and Wolkowicz IITW08II and the references therein 
for more details on cone programming duality. 



9 



3 Equivalence of MQA and QMA log (poly) 



We now prove Theorem 1 1.1 1 i.e., that MQA = QMAi og (poly) . We first show the direction MQA C 
QMA log (poly). Let A = (A yes , A no ) be a promise problem in MQA and let x G {0, 1}" be the input 
string. Suppose the MQA prover sends an m-bit classical proof to the verifier, for polynomially 
bounded m. Then the following simple QMAi og (m) protocol achieves the desired containment: 

QMA log (m) Protocol 

1. Embed classical bits into qubits. Each (unentangled) prover i G [m] sends a single qubit 
\ipi) G C 2 to Arthur. If the z-th prover is honest, his/her qubit is the computational basis state 
corresponding to the z-th bit of the classical MQA proof. 

2. Make things classical again. Arthur measures all proofs in the computational basis, obtaining 
a classical string y G {0, l}" 1 . 

3. Run MQA verification. Arthur runs the MQA verification circuit on x and y and accepts if 
and only if acceptance occurs in the MQA verification. 

The completeness property follows straightforwardly. The soundness property is also easy to 
observe. Note that Arthur runs the MQA verification on a classical string y and hence he accepts 
the string with probability at most 1/3. 

To show the reverse containment, let A = (A yes , A n0 ) be a promise problem in QMA log (poly) 
and let x G {0, 1}" be the input string. Suppose we have a QMAj og (m) protocol for polynomially 
bounded m, where prover i sends a [c log n] -qubit state \ipi) for some constant c > 0. Let 

The MQA protocol proceeds as follows: 

MQA Protocol 

1. Describe proofs classically. The prover sends m classical registers represented by the tuple 
(Ci, C2, • • • , Cm)/ each of length 2n ■ r(n) to Arthur. If the prover is honest, register C, contains 
a classical description of the z-th quantum proof of the QMAi og (m) protocol. 

2. State preparation. Using the contents of register Q, for every choice of i G [m], Arthur pre- 
pares the state \tpj) by first determining a unitary IT; such that Uj |0 ... 0) = \xpj), and then 
implementing ii, with high precision using a finite set of approximately universal gates, ob- 
taining states \ip'j). 

3. Run QMAi og (m) verification. Arthur runs the QMA log (m) verification circuit on \ip'^) <g> • • • <g) 
\ip' m ) and accepts if and only if acceptance occurs in QMAi og (m) verification. 

Observe that each classical register Q is of size polynomial in n, implying the overall proof length 
is of polynomial size. In Step 1, the prover uses n bits to represent the real and imaginary parts 
of each of the polynomially many entities (r(n) entries) required to describe each \xp) . Let the unit 
vector described by register Q be denoted In Step 2, !i, is easily found as the unitary that 
maps |0 ... 0) to \xpi) as the inverse of the unitary that maps \xpi) to |0 . . . 0). Such a unitary can be 
easily decomposed into a product of polynomially many 2x2 rotations on an r(n) -dimensional 
real space and a diagonal unitary as follows. The first step is to convert the vector \tpi) into a 
real vector by applying an appropriate diagonal unitary operator. The second step is to convert 
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the resulting real unit vector into |0 . . . 0) by shifting the amplitudes of any standard basis other 
than |0 . . . 0) to |0 . . . 0). Each of these unitary operators can be implemented by a finite set of 
approximately universal gates (see Bernstein and Vazirani BBV97I1 for details). This step also incurs 
some error, which can be made exponentially small. 

Since Steps 1 and 2 can be performed to within inverse exponential error, we thus can ensure 
II Itfi) ~ Wi) II — 6 Ior a U z ^ [ m ] an d for inverse exponential e > 0. By Lemma 12.11 it follows 
that the overall precision error is at most me for polynomial m, and thus the completeness and 
soundness of the protocol are bounded from below and above by (respectively) 

2 1 

me and — h me. 

3 3 

Alternatively, the containment QMAi og (poly) C MQA can be shown using a slightly differ- 
ent protocol, where Merlin sends classical descriptions of the quantum circuits that generate the 
quantum proofs from |0 . . . 0) instead of classical descriptions of the proofs. 



4 Equivalence of BellQMA [poly, poly] and QMA 

We now show Theorem ll.3l i.e., that BellQMA [r, m] = QMA for polynomially-bounded functions 
r and m. For notational convenience, let riy(z) denote Arthur's 2-th POVM element in Stage 1 of the 
BellQMA verification protocol for the y'-th prover (i.e. YJi=\ HyO) = 1)/ where we assume without 
loss of generality that the number of possible outcomes is exactly r for each prover, and where 
j G [m] for m the number of provers. 

We proceed as follows. Let A = ( A yes , A no ) be a promise problem, and x be an input string of 
length n := \x\. As mentioned in Section [TJ the containment QMA C BellQMA [poly, poly] follows 
straightforwardly since QMA C BellQMA [2, 1]. For the reverse containment, suppose we have 
a BellQMA [r, m\ protocol for polynomially bounded functions r, m : N — > N with completeness 
2/3 and soundness 1/3. We show that this protocol can be simulated by a QMA protocol where 
Merlin sends the following proof to Arthur. 

Merlin sends two registers (X, Y), which should be thought of as the classical and quantum 
registers, respectively. Suppose optimal proofs for the BellQMA [r,m] protocol for input x are 
given by py for j G [m] . Then, in the quantum register Y, an honest Merlin should send many 
copies of the state py. Specifically, Y is partitioned into m registers Yy, one for each original prover, 
and each Yy should contain k copies of py, for k a carefully chosen polynomial. In other words, Y 
should contain the state [pf^Yj <8> ■ ■ ■ <8> \Pm k ]y m - We further view each Yy as a block of registers 
(Vj, . . . , Yj) where Yj should contain the Z-th copy of py. 

In the classical register X, an honest Merlin prepares a quantum state in the computational 
basis, which intuitively corresponds to a bit string describing the m classical probability distribu- 
tions Arthur induces upon applying the measurement operation corresponding to Stage 1 of the 
BellQMA verification to each of the optimal proofs pj, respectively. 

More formally, we partition X into mr registers Xj corresponding to each of the j G [m] provers 
and i G [r] POVM outcomes per prover. The content of Xj should be Pj(i) ■= (TLj(i),pj), truncated 
to ol bits of precision (a polynomially bounded), such that U =1 Py(i') = 1. For example, if the y'-th 
prover 's proof was the single qubit state py = |0) (0|, with Fly(l) = |0) (0| andITj(2) = |1) (l|,then 
Xj = (1,0). We remark that X plays the role of the classical "consistency check" string described 
in Section [TJ 

2 This protocol was mentioned to us by Richard Cleve. 
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Of course, Merlin may elect to be dishonest and choose not to send a proof of the above form 
to Arthur by, e.g., sending a quantum state which is entangled across the registers (X, Y) . To catch 
this, our QMA protocol is defined as follows: 

QMA Protocol 

1. Merlin sends Arthur a quantum state in registers (X, Y), for X and Y defined as above. 

2. Force X to be classical. Arthur measures register X in the computational basis and reads the 
measurement outcome. This forces X to essentially be a classical register of bits, and destroys 
any entanglement or correlations between X and Y. 

3. X should contain probability distributions. Arthur checks whether the content of registers Xj 
form a probability distribution pj, i.e., that Yli=\ = 1- Arthur rejects if this is not the case. 

4. Consistency check: Can the quantum states in Y reproduce the distributions in X? Arthur 
picks independently and uniformly at random, an index j G [m] and another index i £ [r] . 
He applies the measurement {IL(z) }J =1 separately to each register Yj, . . . , Yj, and counts the 
number of times outcome i appears, which we denote henceforth as n;(i). Arthur rejects if 



-ir-Vjif) 



i 

> ~ 
v 



for p a carefully chosen polynomial. 

5. Run Stage 2 of the BellQMA verification and repeat for error amplification. For each prover 
j, Arthur samples an outcome from [r] according to the distribution in (Xj, ...,X r -), and runs 
Stage 2 of the BellQMA verification on the resulting set of samples. He repeats this process 
independently a polynomial number of times q, and accepts if and only if the BellQMA proce- 
dure accepts on the majority of the runs. 

Let us give an intuition behind the above verification procedure. The key step above is Step 
4, where Arthur cross-checks that the classical distributions sent in X really can be obtained by 
measuring m quantum proofs, which for an honest Merlin should be unentangled. In this sense, 
our protocol can alternatively be viewed as using quantum proofs (Y) to check validity of a classical 
proof (X). Intuitively, the reason why entanglement in Y does not help a dishonest Merlin in Step 
3 is due to the local nature of Arthur's checks /measurements. Finally, once Arthur is satisfied 
that X contains valid distributions, he runs Step 5. We remark that repetition is used here in order 
to boost the probability of acceptance in the x G A yes case to exponentially close to 1, which is 
required to separate it from the X G A no case, where the probability of catching a dishonest Merlin 
is only inverse polynomially bounded away from 1. Once such a gap exists, standard amplifi- 
cation techniques IIKW001 IMW05B can be used to further improve completeness and soundness 
parameters. 

To formally analyze completeness and soundness of the protocol, we assign the following 
values to the parameters mentioned above, all of which are polynomial in n in our setting: 

q = 50n and p = 20mr and k = 5p 3 and a. = 20nmr. 



Completeness. Intuitively, when x G A yes , Merlin passes Step 4 with probability exponentially 
close to 1 since he has no incentive to cheat — he can send an unentangled proof in Step 1 to Arthur 
corresponding to the optimal proofs pj in the BellQMA protocol, such that the expected value of 
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rij(i)/k is indeed Pj(i). Arthur's checks in Step 4 are then independent local trials, allowing a 
Chernoff bound to be applied. We then show that Merlin passes each run in Step 5 with con- 
stant probability, and applying the Chernoff bound a second time yields the desired completeness 
exponentially close to 1 for the protocol. 

To state this formally, suppose Merlin is honest and sends registers (X, Y) in the desired form, 
i.e., X- contains pj(i) = (TLj(i),pj) up to a bits of precision, and Yj contains pj. Then, the expected 

value of the random variable ny(z') is E[n ; -(z')] = k (Tlj(i),pj), which is equal to k ■ pj(i) up to the 
error incurred by representing Pj(i) using a bits of precision. In other words, 



E[«;(0] 



P;(*') 



1 1 

< 2* < 2p' 



(1) 



We can hence upper bound the probability of rejecting in Step 3 by 



Pr 



nj(i) 



> 



< Pr 



rijii) E[n y (f)] 



> 



2p 



< 2exp 



5p 



where the first inequality follows from Eq. (fTJ and the second from the Chernoff bound. Thus, 
Merlin passes Step 4 with probability exponentially close to 1. 

We now turn to the final step. Since x G A yes , we know that the optimal distributions, denoted 
qi := ((IL(l),pj) , . . ., (TLj(r),pj)) for 7 E [m], obtained in Stage 1 of the original BellQMA pro- 
tocol are now accepted in Stage 2 with probability at least 2/3. However, in our case, Merlin was 
only able to specify each q; up to a bits of precision per entry as the distributions p;. To analyze 
how this affects the probability of acceptance, let Pj and Qj be diagonal operators with entries 
Pj(i,i) = Pj{i) and Qj(i,i) = (TLj(i),pj), respectively. Letting A accep t denote the POVM element 
corresponding to outcome accept in Stage 2 of the BellQMA protocol, we thus bound the change 
in acceptance probability by: 



Tr 



A 



accept 




< 



;=1 /=1 



< L\\ p i-Qik 

m r 

= EElP;(0-W)^-; 

1=1 i=l 

mr 



< 



220nmr ' 



B|| tr and the second 



where the first inequality follows from the fact that |Tr(AB)| < || 

inequality follows from Lemma l2Tl Therefore, the probability of success for each of the q runs of 
the BellQMA protocol in Step 5 is at least 



mr 



220ii r. 



> 0.6. 



Since each run is independent, applying the Chernoff bound yields that Arthur accepts Merlin's 
proof in Step 5 with probability at least 1 — 2exp(— 0.02^), as desired. There may be some error 
incurred in sampling, which can be assumed to be exponentially small so that the success proba- 
bility of each run is still at least 0.6. 
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Soundness. We now prove that when x G A no , a dishonest Merlin can win with probability 
at most inverse polynomially bounded away from 1. To show this, we bound the probability 
of passing Step 4 by relating the quantity py(z') to the expected value of rij(i)/k, and then apply 
the Markov bound. The desired relationship follows by observing first that the expected value 
of rij(i)/k is precisely the probability of obtaining outcome z when measuring proof j of some 
(honest) unentangled strategy followed by arguing that the distribution py must hence be far 
from this latter (honest) distribution if Merlin is to pass Step 5 with probability at least 1/2 (since 
x G A no ). Combining these facts, we find that Arthur detects a cheating Merlin with inverse 
polynomial probability in Step 4. 

More formally, let the quantum register Yy contain an arbitrary quantum state cry whose re- 
duced states in registers Yj for I G [k] are given by cy (Z), and define 



1=1 



By the linearity of expectation, the expected value of the random variable nAi) Ik is 



E 



ny(0 



E<n / (0^/(0> = <n,-(i)/fy>. 



1=1 



Our goal is to lower bound the expression 

Pr "I® 



P;(0 



> 



(2) 



To achieve this, we first substitute py (z) above with a quantity involving E [zzy (i)/k], and then apply 
the Markov bound. 

To relate E[n,-(i)/fc] to py(z'), we first remark that in order for Merlin to pass each run of Step 
5 with probability exponentially close to 1, he must send probability distributions py, which are 
accepted by Stage 2 of the BellQMA verification with probability at least 1/2. Let 

gy(z):=(ny(zKy). 

Let us imagine a BellQMA protocol where the y'-th Merlin sends cjy as his quantum proof. Since 
x G A no , by the soundness property of the BellQMA(m) proof system, the success probability 
of the Merlins is at most 1/3. In other words, sampling outcomes from the probability distribu- 
tions (cjj(l), . . . ,CJj(r)) and then running the second stage of the BellQMA verification will yield 
outcome accept with probability at most 1/3. Also, observe that 



E 



ny(z) 



It follows that by letting Py and Qy be diagonal operators with the probability vectors py and qj on 
their diagonals, respectively, and A accept the POVM element corresponding to outcome accept in 
Stage 2 of the BellQMA protocol, we have 



1 

10 < 



Tr 



A 



accept 




Qi 



7=1 



< 



7=1 



<E 

7=1 



Q/l 
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Here, the (loose) lower bound of 1/10 comes from the following two observations. First, the 
distributions represented by the diagonal operators Qj's are derived from a BellQMA protocol 
and therefore achieve a success probability at most 1/3 by the soundness property of the Bel- 
lQMA verification. Second, the distributions represented by the diagonal operators Pj's have to 
achieve a success probability strictly greater than 1/2 per run to guarantee that Merlin wins Step 
5 with probability exponentially close to 1. Combining these two, we get that the difference be- 
tween the success probabilities obtained by distributions described by operators { Pj : / G [m] } 
and { Qj : j G [m] } should be at least 1/6 modulo the error incurred due to finite precision when 
encoding the distributions pj. The use of the constant 1/10 overcompensates for this precision 
error. Hence, there exists a / such that 

ll p ;-Q/ll tr = ElP;(0-^(OI> 



i=l 



10m' 



implying the existence of an i such that 



IP;(*') -9/(01 



> 



lOmr 



(3) 



This is our desired relationship between py(z') and E[n<(i)/fc] = cjj(i). Note that the probability of 
picking pair in Step 4 is 1/ mr. 

We now substitute this relationship into Eq. (0 and apply the Markov bound. Specifically, 
choose i and j as in Eq. (J3J), and assume that pj(i) > (n ; (z), £;). Then, we have 



Pr 



ny(z') 



p/(0 



1 

< - 
V 



< Pr 



nj(i) 



E 



n ; (z) 



> 



lOmr 



^1 — — — . 
2p 



The case of py(j') < (n,-(l), £,■) is similar. We conclude that a dishonest Merlin is caught in Step 4 
with probability at least l/2p. Therefore, the probability that Arthur proceeds to Step 5 is upper 
bounded by 

^'x-J-Wi-i-Wi 1 



mr 



20mr 



mr 



20m 2 r 2 ' 



where the first term represents the case where Arthur selects the correct pair to check, and 
the second term the complementary case, in which we assume the cheating prover can win with 
probability 1. Hence the overall success probability of a dishonest Merlin is at most 1 — l/20m 2 r 2 , 
which is bounded away from 1 by an inverse polynomial. 

Finally, as mentioned before, since m and r are polynomially bounded functions, we have that 
the completeness is exponentially close to 1, while the soundness is bounded away from 1 by an 
inverse polynomial. By known amplification techniques for QMA protocols IIKWOOt fMW05ll , one 
can amplify the completeness and soundness errors to be exponentially close to 0. This proves our 
desired containment. 



5 Perfect parallel repetition for SepQMA(poly) 

We now show Theorem ll.4[ i.e., that the class SepQMA(m) admits perfect parallel repetition. Be- 
fore we proceed, recall that the closed convex cone Sep . . . , X m ) is defined to contain operators 
of the form 

k 

£Pi(i)®---®P m (i), 

i=l 
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where P;(i) G Pos (Ay), for every j G [m] and i G [fc]. This is the cone of interest and it is known 
to be closed and convex with non-empty interior. Given C to be the measurement operator corre- 
sponding to outcome accept, the maximum success probability of the Merlins in any QMA(m) pro- 
tocol can be written as the maximum of (p, C), where p is a density operator in Sep {X\, . . . , X m ). 
By standard convexity argument, one can always assume that the maximum is achieved by a pure 
product state. 

For the remainder of the section, it will be convenient for us to distinguish two instances of 
SepQMA(m) protocols as the first and second protocol. For the first SepQMA(m) protocol we 
can write the maximum acceptance probability as the optimal value of the primal problem in the 
following primal-dual pair (where the operator C\ is Arthur's POVM element corresponding to 
outcome accept): 



Primal problem (Pi) 
maximize: (p\,C\) 
subject to: Tr(pi) = 1, 

p\ G Sep {X\, . . . , X„ 



Dual problem (Di) 
minimize: t\ 

subject to: t{l x = Q + W lr 

Wi eSep*(X 1 ,...,X m ), 



where X denotes X\ 
programs since 



■ ■ ■ <S> X m . The use of "maximum" and "minimum" is justified in the above 



are strictly feasible solutions for (Pi) and (Di), respectively IIGB02llGB03UGBd5ll . Hence, by The- 
orem |2]6l strong duality holds for both problems, i.e., both problems attain an optimal solution 
and the optimal values are the same. We can similarly formulate the acceptance probability of the 
second protocol as 



Primal problem (P2) 



Dual problem (D2) 



maximize: (p2, C 2 ) minimize: £2 

subject to: Tr(j02) = 1, subject to: t 2 ly = Cz + W2, 

pi g Sep (y lr . . . , y m ) , w 2 g se P *(^i, .... y m ), 

where y denotes ^1 ® • • • ® y m . Since we are considering SepQMA protocols it holds that 

Ci G Sep (X\, . . . ,X m ) and C 2 G Sep (y lf . . . , y m ) . 

Given the two cone programs above, the maximum acceptance probability of the two-fold 
repetition of the protocol can hence be expressed as 



maximize: 
subject to: 



Primal problem (P) 

(p, Ci ® Cz) 
Tr(p) = 1, 
p G Sep (Xi 



y 1 ,...,x m ®y„ 



Dual problem (D) 

minimize: t 

subject to: t\x®y = Ci ® C2 + W, 



Note that the operators p and W are elements of Herm (X\ <g) • • • ® X m ® y\ ® ■ ■ ■ (g> y m ). 

To show Theorem II .41 observe that if p\ and pz are any respective optimal solutions of (Pi) 
and (P2), then p\ <g> p 2 is a feasible solution of (P). Therefore the optimal value of (P) is at least the 
product of the optimal values of (Pi) and (P2). It remains to show that in fact no other strategy for 
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the prover can perform better than this honest strategy. To do so, we demonstrate a dual feasible 
solution for (D) attaining this same objective value. 

More formally, let (t\, W\) and (tj, W 2 ) be respective dual optimal solutions of (Di) and (D2). 
By strong duality, t\ is the optimal value of (Pi) and £2 is the optimal value of (P2)- We show that 
£1 • £2 is an upper bound on the optimal value of (P) by exhibiting a solution (£1 • £2, W) which is 
feasible in (D), for some W G Sep*(X\ ® ^1, . . ., X m ® y m ). We first prove the following useful 
lemma. 

Lemma 5.1. For complex Euclidean spaces X\, . . . , X m and y\, . . . , y m , the following two containments 
hold: 

• Sep*(X lf ...,X m )® Sep(^, ...,y m )C Sep*(Ai ®y x ,...,X m ® y m ), and 
. Sep(#i, . . . , # m ) ® Sep* (y lt ...,y m )Q Sep*(Ai ® 3>i, ■ ■ * m ® y m ). 

Proof. We prove the first condition as the second is nearly identical. Fix W G Sep*(X\, . . . , X m ) 
and C G Sep(J ; i, • • • , 3^)- Then for S G Sep(Af 1 ® 3^, . . . , X m ® 3m)/ we have 

(W®C,S) = (W,Tr y [S(1*®C)]> >0, 

if Try [S(l x ® C)] G Sep(A' 1 , . . . , X m ). Therefore, it suffices to prove that Try [S(l x ® C)] G 
Sep( Ai, . . . , A^i). To this end, let 

km k' m 

S = E0MO and C = ^0^) ( 

i=l j=i 7=1 ;=i 

where p,-(Z) G Pos (Af z (8) 3^) and Py(Z) G Pos (3>/) for all z G [k], j G [fc'], and I G [m]. Now we can 
write Try [S (l x ® C)] as 



Tr 



3' 



'it b \ / V m N 

,i=l Z=l / V 7=1 Z=l , 



It V m 

»=i 7=1 fc=i 



which is clearly in Sep (X\,..., X m ). This concludes the proof. □ 

We now use Lemma 15.11 to construct two operators in Sep*(Ai ® 3^1, • • • , ® 3^w)/ the ap- 
propriate convex combination of which is the dual feasible solution we are seeking. Specifi- 
cally, observe first that since for the two instances of the SepQMA(m) protocol, we have C\ G 
Sep {X\, . . . , X m ) and C2 G Sep (3\, . . . , y m ), and since t x and ly are fully separable operators, it 
follows that 

hl x + Ci G Sep(Af a , . . . , * m ) and £2% + C 2 G Sep(^, . . . , y m ) 
for all £1, £2 > 0. Using Lemma 1531 we thus obtain operators 

(hl x - Ci) ® (t 2 ly + C 2 ) G Sep* ® y lf . . ., X m ® y m ) (4) 

and 

(£ii^ + Ci)® (£ 2 i y -c 2 ) esep*(x 1 ®y 1 ,...,x m ®y m ), (5) 

where £il^- — C\ G Sep*(Ai, . . . , Af m ) by the constraints of (Di), and similarly for £2!^ — C2. Since 
Sep*(A'i ® 3^1/ ■ • • / X m ® 3^m) is a convex cone, it follows that the average of Eqs. © and © yields 
the desired operator 

W:=h- h t my - Ci ® C 2 G Sep*(#i ® 3>i, • • •, *m ® 3>m). 
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We conclude that (t\ ■ £2, W) is a feasible solution of the dual problem (D) with objective value 
t\ ■ £2 as desired. This concludes the proof of Theorem ll.4l 

We note that there are instances of QMA(poly) protocols, which are not SepQMA(poly) pro- 
tocols, that admit perfect parallel repetition. Although this fact is known in the literature (see 
Harrow and Montanaro IHM10I1 for details), we provide a concrete example below. 

First, note that the maximum acceptance probability of Arthur in a QMA(m) protocol is upper 
bounded by || C jj^, where C is the accepting measurement operator. Now, consider the two-qubit 
POVM operator 

C := i |00) (00| + 1|Y+)(Y+|, 

where 

|Y+) : = |oi> + -L |10) . 

We can easily check that C has two eigenvalues, and 1/2, and two principal eigenvectors 1 00) 
and |Y + ), one of which is a product state. It follows that the maximum acceptance probability is 
1/2. By the multiplicative property of the infinity-norm under tensor products, it holds that the 
maximum acceptance probability of the fc-fold repetition is exactly l/2 k . 

We now argue that C is not a separable operator. Suppose for the sake of contradiction that C 
can be written as 

n 

X] Pi ® Vi 

1=1 

for some pi, Wi G Pos(C 2 ). Then we have 

= (0,111) = E(i|p f |i>(ik i |i> / 

i=l 

which implies Pi\l) = or cr t • |1) =0 for all i G [n]. This leads to the contradiction 

\ = (C,\01)(10\) = f^(l\p i \0) (0| t r i |l>=0. 

Alternatively, one can show that C is not separable by observing that C has a non-positive partial 
transpose llPer96ilHTTH96l 

6 Conclusions and open problems 

In this paper, we have studied three variants of multi-prover quantum Merlin- Arthur proof sys- 
tems. We first showed that a system with polynomially many provers is indeed strictly more 
powerful than a single prover system if messages are restricted to be logarithmic in length, unless 
BQP = MQA. We next showed that polynomially many provers do not provide additional ex- 
pressive power over a single prover in the setting where the verifier is restricted to first applying 
unentangled and non-adaptive measurements with at most a polynomial number of outcomes 
per proof. Both of these questions make steps towards understanding the major open question of 
whether QMA with polynomially many provers is more powerful than QMA. Finally, we used 
cone programming duality to give an alternate proof of the fact that perfect parallel repetition 
holds whenever a QMA verifier's POVM element corresponding to accept is a fully separable op- 
erator. 
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A consequence of our first result is that the two variants of the class QMA(poly) / where Mer- 
lins send logarithmic-size proofs and Merlins send constant-size proofs are equal. A natural ques- 
tion concerning our first result is to understand the expressive power of the variant of QMA(poly), 
where Merlins are restricted to send poly log (|x|) qubits to Arthur. Another open question con- 
cerning the results presented in this paper is the relationship between BellQMA(poly) and QMA. 
We believe that understanding the complexity of BellQMA protocols, or more generally LOCC- 
QMA protocols, will shed new light on the bigger question pertaining to QMA(2) and QMA. An- 
other avenue of interest is to find further applications of the cone programming characterization 
of multi-prover quantum Merlin- Arthur proof systems. A straightforward question concerning 
the parallel repetition result presented in this paper is to investigate whether cone programming 
duality can be used to analyze the product state test in the Ref. BHM10L Another question one can 
ask is to find other classes of QMA(m) protocols that admit a perfect parallel repetition theorem. 
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